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1. Introduction 

1.1. A lacunary sequence is a sequence of integers a(x), x = 1,2,... 
which satisfies the "gap condition" 

lim inf - - > 1 . 

a(x) 

A primary example is to take an integer g > 2 and set a(x) = g x . 

As is true for any increasing sequence of integers, for almost every a 
the fractional parts aa(x) are uniformly distributed modulo 1. More- 
over, for lacunary sequences, it has long been known that the fractional 
parts of aa(x) have strong randomness properties. For instance, the 
exponential sums ^= J2 x <n cos(27raa(x)) have a Gaussian value dis- 
tribution as iV — > oo (see the survey in 0). 

In this paper, we show that lacunary sequences have additional fea- 
tures in common with those of random sequences, which is the as- 
ymptotic distribution of spacings between elements of the sequence: 
Given a sequence {9 n } C [0,1), the nearest-neighbor spacing distri- 
bution is defined by ordering the first N elements of the sequence: 
0i,n < @2,n < • ■ • < @n,n, and then defining the normalized spacings 
to be 

8W : = N(9 n+ljN - 9 n , N ) . 

The asymptotic distribution function of {5^}^ =1 is level spacing dis- 
tribution -Pi(s), that is for each interval [a, b] we require that 

hml#{n < N : 5i N ) E [a,b]} = f P 1 (s)ds . 

The statistical model we have in mind is the "Poisson model" , of a 
sequence generated by uncorrelated levels (i.i.d.'s). In that case P(s) = 
e~ s . Moreover in that model one knows the behavior of all other local 
spacing statistics, such as for instance: 
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1. Instead of spacings between nearest neighbors, one can consider 
spacings between next-to-nearest neighbors or more generally for 
any fixed a > 1, set 

c := N(ez\ - e } ) ■ 

Let P a (s) be the limiting distribution function of {Si^J} as N — > 
oo. In the Poisson model, 

„a— 1 

2. For fixed r > 1 consider the joint distribution of the nearest neigh- 
bor spacings (5^\ 5^} 1 , . . . , 5 ( r ^ r _ 1 ). In the Poisson model, these 
are independent and so the distribution function is Y\i=i e ~ Si ■ 

3. For fixed A > 0, consider the probability of finding exactly k 
elements of the sequence {6 n : n < N} in a randomly chosen 
interval of length X/N. In the Poisson model, this probability is 

1.2. Results. The principal result of our paper asserts that 

Theorem 1.1. Let a(x) be a lacunary sequence. Then for almost all 
a, the fractional parts of the sequence {aa(x)} has all its local spacing 
measures as those of the Poisson model. 

As is well known, all local spacing measures are determined by the 
correlation functions, which measure the distribution of spacings be- 
tween tuples of elements, not necessarily neighboring. To define the 
/c-level correlation function, for x = (x±, . . . ,Xk), denote by A(x) the 
difference vector 

A(x) = {a(x 1 ) - a(x 2 ), a(x fe _i) - a(x k )) . 

Take a smooth, compactly supported function / G C^°(R fc_1 ), and set 

Fn(v):= f(N(m + y)). 

We then define the /c-level correlation sum associated to this data by 
(1.1) R k (f,N)(a) :=^J2* <n Fn(^(x)) 

iV z — <Xi<N 

where Yl* m eans the sum over all vectors with distinct components: 
Xi 7^ Xj if % ^ j. Our main result is: 

Theorem 1.2. There is a set of a of full measure so that for all k > 
2 and all test functions f G C^°(R A:_1 ) ; the k-level correlation sums 
Rk(f, N)(a) converge to f f(x)dx. 
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By standard results, this implies Theorem |1 . 1| . The case of pair 
correlation (k = 2) was done in [|Kj . 



1.3. Comparison with polynomial sequences. Much of the work 
done previously on spacings of fractional parts was for polynomial se- 
quences, such as a(x) = x 2 see also [0, |TTJ. Rudnick and 



Sarnak M proved the analogue of Theorem [I.2| for the pair correlation 



function (k = 2). However, the method used both in || and here, 
which proves almost-everywhere convergence by going through conver- 
gence in L 2 , already fails in the case of a(x) = x 2 at the level of triple 
correlation, because the variance diverges as N — > oo. 

The reason for the difference between these two cases can be under- 
stood by examining the number of solutions of the equation 

(1.2) m(a(xi) - a(x 2 )) + n 2 ((a(x 2 ) - a(x 3 )) = 

n i( a ( x i) ~ °(4)) + n' 2 (a(x' 2 ) - a(x' 3 )) 

in variables bounded by N, and n,n' ^ 0. For a(x) = x 2 the number 
of solutions of (|1.2| ) is ^> A^ 7 . This is consistent with the heuristic that 
zero is a typical value of the difference of the two sides of the equation, 
and for a(x) growing as slowly as x 2 the size of this difference is at 
most 0(N 3 ) while the number of variables is 10. Thus the typical 
difference should occur about iV 7 times. As is explained in ||, this 
effect causes the variance of Rs(f,N) to blow up like N. A similar 
effect will cause the blow-up of the variance of high correlations for 
any polynomially increasing sequence. The non-Gaussian distribution 
of the "theta sums" ^= J2 X<N exp(2wiax 2 ) is related to this kind of 
clustering effect ||3], ^jfl. 

In contrast, for lacunary sequences we will show in section |2| that the 
number of solutions of ( |1.2j ) is 0((N log N) 5 ), which is not much more 
than the number of "diagonal" solutions x = x', n = n'. 

1.4. Plan of the paper. We begin in section |2| with a key counting 
argument: We consider the number of solutions of an equation 

(1.3) mi(a(ii) - a(x 2 )) H h m fc _i(a(x fc _i) - a(x k )) 

= mi(a(xi) - a(x 2 )) H h m'^a^x'^) - a(x' k )). 

in integers ^ m, m' G [— N, A^] fe_1 , x, x' G [1, A^] fc , xi, ■ ■ ■ , x^ distinct, 
x[, ■ ■ • , x' k distinct. In Lemma |2.4j we show that the number of such 
solutions is 0(N 2k ~ l log 2fc_1 N). This is comparable to the number of 
"diagonal" solutions, which is of order N 2k ~ x . For fixed coefficients m, 
m', the diagonal solutions are indeed responsible for the bulk of the 
solutions, see e.g. H. 
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We then show in section Q that the mean of Rk{f, N) is asymptotic 
to J f, and in section |4] we show that the variance decays with N: 
var(i?fc(/, N)) <C N~ 1+e , for all e > 0. These are done by a reduction 
to the study of solutions of ( |1.3| ) . 

In section |^ we show almost-everywhere convergence, after first in- 
vestigating in section [5] the frequency of occurrence of fractional parts 
of aa(x) in short (of size 1/N) intervals. 



2. A COUNTING LEMMA 

Let a{x) be a lacunary sequence, that is there is some c > 1 so that 

a{n + 1) > ca{n) 

for all n sufficiently large. We wish to estimate the number of solutions 
of an equation such as ( |1.3|) . We will do so in Lemma [2.4| , after some 
preliminaries. 

Lemma 2.1. Let s > I, C > and let A 1 > A 2 > ■ ■ ■ > A s be 

positive integers. Then for any b G Z and N > 1 the number of vectors 
y = (yi, • ■ ■ , y s ) G Z s with \yx\, ■ ■ ■ , \y s \ < N such that 

(2.1) \y lAl + ... + y sAs + b \ <CAt 

is Os^N*- 1 ). 

Proof. We need to count the number of integer points y inside the 
region flcR s which consists of the points in the cube [-N, N] s which 
lie between the hyper-planes 

y x A x + ■■■ + y s A s + b = CA X 
^ ' ' y\A x + ■■■ + y s A s + b = -CA X . 

Note that the region Q is convex and contained in a ball around the 
origin of radius -C s N. By the Lipschitz principle (see @) we know 
that 

(2.3) #(HnZ s ) = vo\{Q) + O s {N s - 1 ). 

The distance between the above hyper-planes is 

2C '± <2C 

thus fl is contained in a cylinder of height 2C whose base is an (s — 1) — 
dimensional ball of radius <^C S N. Therefore vol(fi) = O s 5 c(A^ s_1 ) which 
together with (|2.3|) gives the lemma. □ 
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Lemma 2.2. Let s > 2 and z\ > ■ ■ ■ > z s be positive integers. Then 
for any b,d G Z and any N > 1 i/ie number of vectors y = (y±, ■ ■ ■ ,y s ) G 
Z s u>z't/j • • • , \y s \ < N for which 

\yia{zi) H h y s a(2; s ) + b\ < Ca(z 1 ) 

2/1 H h y s + = 

/ioWs irne zs SjC (A^ s_2 ). 

Proof. We first remark that since s > 2 and the z's are distinct, the 
hyper-planes ( p.2|) with Ai, • • • , A s replaced replaced by a(zi), • • • , a(z s ) 
are not parallel to the hyper-plane given by the equation (|2.4|) . More- 
over, the fact that our sequence is lacunary insures that the angle 
between these hyper-planes is not small. Thus when we solve for y s in 
(|2.4j ) and input the result in fl2.1|) we get an inequality in s — 1 variables: 

(2.5) 

\yx(a(zi) - a(z s )) H h y s -i(a(z s -i) - a(z s )) + b- da(z s )\ < a(zi) 

in which the RHS is bounded by the largest of the coefficients which 
appear in the LHS: 

a(zi) - a{z s ) > (1 - ^)a(^i) . 



Then Lemma |2.1| applies to (|2.5| ), with Aj = a(zj) — a(z s ) for 1 < j ' < 
s — 1 and C — (l — -) , and we find that the number of vectors y 
having the required properties is O s ^ c (N s ~ 2 ) as stated. □ 

We now come to our main counting lemma. 

Lemma 2.3. Let r > 1 be an integer. For any N > 1 the number of 
solutions (yi, ■ ■ ■ ,y r ,Zi,--- ,z r ) to the system: 

yia(zt) H h y r a(z r ) = 

2/i H h Vr = 



(2.6) 

in integers yi, • • • ,y r , 



( yi ,---y r )^(0,--- ,0) 
Zi, - • • i z r > 1 distinct 

\vi\, ■ • , |yr|, N, •• • , kr| < iv 



zs O^^^-Mog"- 1 ^). 

Proof. Our proof is by induction on r. The case r = 1 is clear, the num- 
ber of solutions in this case being zero. Let us assume that the state- 
ment holds true for r — 1 and prove it for r. Let (y±, ■ ■ ■ , y r , Zi, ■ ■ • ,z r ) 
be a solution to the system ( |2.6j ). If there exists j G {1, • • • ,r} such 
that yj = then (y u ■ ■ ■ , y^i, y j+1 , ■■■ ,y r ,z 1 ,--- , z j+1 , ■■■ ,z r ) 
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will be a solution for the same system with r replaced by r — 1. By 
the induction assumption the number of solutions of this system is 
O r ,c(N r ~ 2 log 7 * -2 N). For each such solution, Zj is free to take values 
< N. Therefore the number of solutions to the system ( |2.6| ) for which 
at least one of yi, • ■ • , y r vanishes is O rjC (iV r-1 log r ~ 2 N). We now count 
the solutions to ( |2.6| ) with yj ^ for all j. There are r! possible or- 
ders for the z's. Let us count the solutions for which z\ > • • ■ > z r . 
Given such a solution (yi, - ■ ■ , y r , Zi, ■ ■ ■ ,z r ) we consider the partition 
of the set {1, • • • , r} as a disjoint union of sets B±, ■ ■ ■ ,Bi defined as 
follows. B\ consists of those j G {1, • • • , r} for which Zj > Z\ — 2 \°^ ■ 
If j 2 is the smallest index not contained in B\ then we put in B 2 all 
those j G {j 2 , • • • ,r} for which Zj > Zj 2 — 2 1 °| e , and so on . In the 
end, if 1 = ji < j2 < ■ ■ • < ji are the smallest indices contained in 
Bi, B 2 , ■ ■ ■ ,B[ respectively, then we have: 

21ogiV 21ogiV 
[Z. () z h < z h — — s z j2 -i, ■ ■ ■ , z k < z h _ x — — < Zj,-!. 

The number of partitions as above is bounded in terms of r. Let us 
count the number of solutions (yi, ■ ■ ■ ,y r ,Zi,--- ,z r ) which correspond 
to a given partition B\, ■ ■ ■ ,Bi. We distinguish two cases: j^B\ > 2 
and #5; = 1. 

Let us first treat the case j^B\ > 2. If we fix Zj 1} Zj 2 , ■ ■ ■ Zj { then from 
(|2.7|) it follows that each of the remaining z's can take at most f 2 '"^ ] 



values. Hence the number of vectors z — (zi, ■ • ■ ,z r ) satisfying (|2.7|) is 
O r)C (N l log r_z N). Thus we are done with the case j^Bi > 2 if we show 
that for any vector ifas above the number of solutions y = (yi, • • • , y r ) 
is TyC (N r ~ l ~ 1 ). Fix some such z and note that by (|2.7|) one has: 

a( Zl ) _ / \ , a ( z h-i) 



( 2 - 8 ) < z h) < -p-> • " • > °(%J < ^2 • 

Let us take a solution y and look at its first j 2 — 1 components. These 
are nonzero integer numbers in the interval [-N, N] satisfying the in- 
equality: 

|yio(zi) H \- y j2 -ia(z h -i)\ = | ^Vja(zj)\ < rNa(z j2 ) < a{z x ). 

Here we may apply Lemma |2.1| with s = j 2 — 1, b = and A±, ■ ■ ■ ,A S 
replaced by a(zi), • • • , a(z s ) to conclude that the vector (yi, • • • , y^-i) 
can only take O r {N^ 2 ~ 2 ) values. Let us fix (yi, ■ ■ • , yj 2 -i) and count the 
number of solutions y whose first j 2 — 1 components are yi, ■ • • , yj 2 -i- 
We are now interested in those components yj of y for which j G B 2 . 
Write b = y\a{zi) + • — h%- 2 -ia(% 2 -i) and use ( |2.8| ) to deduce that for 
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any solution y, its components yj with j G B 2 satisfy the inequality: 

\ b + Yl yA z j)\ = l yM z i)\ < a (% 2 )- 



By Lemma |2.1| we know that as y varies, the vector formed with the 
components yj of y for j G B 2 can only take O r (iV*' B2_1 ) values. We 
now repeat the above reasoning until we get to the last set of com- 
ponents of y, namely the j/'-s with j G B\. The components yj with 
j < ji being fixed, write b = Ei<j<j ; Vj a ( z j), d = J2i<j<j t Vj and then 
apply Lemma ^72] (here one uses the assumption that j^Bi > 2). It 
follows that the vector formed with the components yj, j G Bi of y can 
take O r ^ c (N# B '~ 2 ) values only. The number of solutions y for a fixed 
fas above is then < r , c JV(#Bi-i)+--+<#Si-i-i)+(#Si-2) = N*- 1 ' 1 , which 
completes the proof in case > 2. 

Assume now that #Bi = 1. Then j/ = r. In this case we fix 
Zi, • ■ ■ , ;z r _i only . This can be done in O rfi (N l ~ l log r_z AT) ways. For 
Zi, • • • ,z r _i fixed we apply Lemma repeatedly to conclude that 
as the vector (yi, ■ ■ ■ ,y r , z r ) varies in the set of solutions, the vector 
(2/i, • ■ • , Vr-i) can take O r)C (Art# B i- 1 )+"~K#Si-i- 1 )) = O r ^N r ~ l ) values 
only. Now for y%,--- , y r -i, • • • , z r -i fixed, y r and uniquely 
determined from the last two relations in ( |2.6|) (here one uses the fact 
that y r 7^ 0). Thus the number of solutions (yi, ■ • ■ ,y r ,Zi,-- - ,z r ) 
is O r>c (N r ~ l log r_i N) in case j^B\ = 1 as well, and the lemma is 
proved. □ 

We intend to use the above counting lemma to bound the number of 
solutions of the following equation: 

(2.9) mi(a(ni) - a(n 2 )) H h m k ^i(a(n k -i) - a(n k )) 

= m'^ain^) - a(n' 2 )) + ■■■ + m^_ 1 (a(n^_ 1 ) - a{n' k )). 

in variables mi, • • • , m k -i, m[, ■ ■ • , vn! h _ x £ Z, m, • • ■ , n^, n^, • • • , 7*4 G 
N, ni, • • • , Uk distinct, n[, ■ ■ ■ ,n' k distinct, 

(mi, • • • , m fc _i, m[, ■ ■ ■ , m' h _^) ^ (0, ■ • • , 0) 

and all variables of absolute value at most N. 
The result we obtain is the following: 

Lemma 2.4. Let k > 2, k G Z. For any N > 1 me number of solutions 
to the system (p|) O k}C (N 2k - 1 log 2 ^ 1 A 7 "). 

In order to simplify the combinatorics involved in the derivation of 



Lemma [2.4| from Lemma |2.3| we first establish a more general form of 
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Lemma |2.3j . Let r > 1 and consider the system: 

2/ia(^i) + ■ ■ ■ + y r a(z r ) = 

yi + • • • + Vr = o 

y u ■•• ,y r G Z, z x , • • • ,z r G N 

• • • > \Vr\, \zi\, • • ■ , \Zr\ < N 



Let (y, z) = (yi, ■ • ■ , y r , zi, ■ ■ ■ , z r ) be a solution of (|2.10|) . For any 



i G {1, • • ■ , r} denote A(i) — {1 < j < r : Zj — Zi}. We say that the 
solution (y, z) is degenerate provided we have 

(2-11) E% =0 

jeA(i) 

for all % G {1, • • • ,r}. Otherwise we say that (y , z) is non- degenerate. 
We have the following : 

Lemma 2.5. Let r > 1. T/ien /or any iV > 1 i/ie number of non- 
degenerate solutions to ( |2.10| ) is O rfi (N r ~ l log r ~ 1 N). 

Proof. Each solution (jj, z) to ( |2.10| ) produces a partition of the set 
{1, • ■ ■ , r} as a disjoint union of subsets A%, ■ ■ ■ , A h where A 1: ■ ■ ■ ,Ai 
are the above sets A(l),--- ,A(r) without repetitions. Let us count 
the number of non-degenerate solutions to ( 2 . 1 (J| ) which correspond to 



a given partition A±, ■ ■ ■ , Ai of the set {1, • • • , r}. For s = 1, 2, • • • , I de- 
note u s = J2j<=A s Vii v s = z j f° r j £ A s , then write u 



(«!,••• ,Vg). If (y, z) is a non-degenerate solution to ( |2.10|) then not all 



(2.12) 



the numbers Ui, • • • , u s vanish. One sees that for any such (?/, z) the 
pair (m, v) is a solution of the system: 

U\a{zi) + • ■ ■ + Uia(zi) = 

U\ H h = 

in integers «i, • • • , G Z, w ^ 0, v i, • • • ,vi <E N distinct, 

|«i|, ■ ■ • , \ui\, |vi|, ■ ■ • , < AT 

By Lemma |2.3| we know that the number of solutions of the system 
(|27T2l ) is 0j >c (iV'- :L log I ~ 1 iV). Now fix a solution (u,v) and count the 
number of non-degenerate solutions (y, z) to Q2.10D which correspond 
to the above partition Ai, ■ ■ ■ ,Ai and which produce the vector (u, v). 
Clearly z is uniquely determined since Zj = v s for any s and any j G A s . 
Moreover, for any s the number of solutions yj,j G A s of the equation 
J2jeA s yj = u s IS Or(^ #As_1 ). Hence the number of solutions (y, z) 
which correspond to a given pair (u,v) is O r (N^ Al ~ 1 " )A K# A s-i)^ _ 
O r (N r ~ l ) and so the total number of non-degenerate solutions to ( |2.10| ) 
is O r (iV r ~ 1 log r_1 N), which completes the proof of Lemma [2.5| . □ 
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Proof of Lemma |2.4| : Denote r = 2k, Z\ = n\, ■ ■ ■ , z k = n^, Zk+i 



n[, ■ ■■ , z r = n' k , yi = mi, y 2 = m 2 - m x , • • • , y k -i = m k -i - m k - 2 , yu = 
-m k -i,yk+i = -m\,yk+i = m[ - m' 2 , ■ ■ ■ ,y 2 k-i = rn' k _ 2 ~ m 'k-i and 
y k = m' k _ v Then any solution (fh, n, fh', n') of (|2.9|) produces a solution 
(y, z) of ( p.lOj ) (with N replaced by 2N) which satisfies the additional 
properties: 

(2.13) 2/iH hyjfe = 

with (yi, ■ ■ ■ , y r ) ^ (0, • • • ,0), zi, ■ ■ ■ ,Zk distinct, Zk+i, ■ • • z r distinct, 
and each such (y, z) uniquely determines the tuple (m, n, fh! , ft'). 

Thus we are done if we show that the number of solutions to ( |2.10|) 
which satisfy the additional requirements (|2.13|) is O r (N r ~ l log r ~ 1 N). 
Lemma |2.5| takes care of the non-degenerate solutions to (|2.10|) so it 



remains to count the number of degenerate solutions to Q2.10 ) which 
satisfy (^131) . 

Let (y, z) be such a solution. If distinct then by the 

degeneracy conditions (|2.11| ) it follows that yi = y 2 — • • ■ = y r — 
which contradicts (|2.13| ). Thus some Zj with 1 < j < k will have to 



equal some Zj with k + 1 < j < 2k. Let s be the number of indices 
j G {1, • • • , k} for which there exists i G {k + 1, ■ ■ • , 2k} such that Zj = 
Zi. Both (|2.10|) and ( |2.13|) are symmetric in Zi, • • • , z k and separately 



in Zfe+i, • • • , z 2 k and the same holds true for yi, ■ ■ ■ ,yk respectively 
yk+i, ■ ■ • , Vik- After making a permutation of variables if necessary, we 
may assume that Zj = Zj + k for 1 < j < s. Then the sets A\, • • • ,A\ look 
like this : A x = {1, k + 1}, A 2 = {2, k + 2}, • • • , A s = {s, k + s}, A s+1 = 
{s + 1}, • - • , A k = {k}, A k+1 = {k + s + 1}, - ■ ■ ,A l = {2k}, where 
/ = 2k — s. The degeneracy relations ( |2.11| ) become: 



(2.14) 



yj + y j+k = 0, 1 <j <s, 

yj = 0, s + 1 < j < k or k + s + 1 < j < 2k . 



Now, given an s G {1, • ■ ■ , k} and the above partition A\, ■ • ■ ,A\, the 
number of degenerate solutions (y, z) which correspond to this partition 
are counted as follows. On one hand each of the / distinct z's can 
assume at most N values , so z takes at most iV' = N 2k ~ s values. On 
the other hand, each of the variables yj (if there are any) with 2 < j < s 
assumes at most 2iV + 1 values and for each such choice of the vector 
(y 2 , ■■■ ,y s ) the varia bles y k+2 , yk+s, y s +i, ■ ■ ■ , Vk , Vk+s+i, ■ ■ ■ ,y 2 k are 
determined by ( |2.14j) , then y\ is determined by ( |2.13|) and the remaining 
variable yk+i is determined by (|2.14|) . Hence y takes at most (2A^+1) ,S_1 



values and the number of degenerate solutions (y, z) is O r (N 2k x ), which 
completes the proof of Lemma [2.4|. □ 
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3. The average value of R k (f,N) 
3.1. Poisson sum. Recall that for / G C^°(R fc_1 ), y G R fe_1 , we set 
Fn(v)= E /(iV(y + m)). 

By Poisson summation, 

(3-1) ^) = ^zr E f(^)e(n-y). 



By inserting (|3.1|) into the definition ( |1 . 1|) of Rk(f,N) we find: 

(3.2) R k (f,N)(a) = ±- k E /(^)El^ e (^- A ^))- 

Since R k (f,N)(a) is periodic in a, we may expand it in a Fourier 
series 

(3.3) R k (f,N)(a) = -Lj2 b &N)e(la) 

zez 

where 

bd,N)= E /(£)■ 

c „ fc _i n-A(x)=l iV 

3.2. The mean of Rk{f, N). From ( |3.3|) we can immediately compute 
the mean of Rk{f, N) as 



E(R k (f,N))= / R k (f,N)(a 



6(0, iV) 
N k 

Lemma 3.1. Assume a(x) is a lacunary sequence. Then Ve > ; 
E(R k (f,N)) = b -^l = f(0) + O u ( ' > 



N k ■> V > Wtfl-e' 

Proof. We write 

6(0, N) = /(0)#{arj < iV : distinct} + b(N) 

= f(0)N k + 

where 

(3-4) w = EE*^ /(£) 

we will show that b(N) <C jV fc ~ 1+e and thus prove our lemma. 
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Fix e > 0, and let 5 = e/2(k - 1), R > (100 + k)/8 + k. Since 
/ G C^°(R fe ~ 1 ), <C \x\~ R for large \x\. Now divide the range of 

summation in (13.41) into < \n\ < N 1+s and \n\ > N 1+s : 



n <-R 



kN)« f E E ^i + E,^ E \jf 

The second sum is bounded by 

jyk+R \ „ jyk+R-(l+6)(R-k) „ jyfe-100 

| n |>ATi+i 1 1 

by our choice of 5 and i£. 

As for the first sum, it is bounded by the number of x = (x\, . . . , Xk) 
with distinct Xi < N 1+s , and n E Z fe_1 with < |n| < N 1+s such that 
n-A(x) = 0. By Lemma |2~3|, this number is < (iV 1+5 log(iV 1+<5 )) fc_1 < 
iV fc - 1+e . Thus we find that 6(iV) < jV fc - 1+e as required. □ 

4. Estimating the variance 
Proposition 4.1. TTie variance of Rk(f, N) satisfies 

vzr{R k (f,N)):= [ \R k (f,N)(a)-E(R k (f,N)\ 2 da^ t 
Jo 

for all e > 0. 



N 1 ' 



Proof. By fl3.3|) we have 



var(i2fc(/, iV)) = E(|i2 fc (/, iV) - 6(0, N)\ 



2\ 



(4.1) 

Moreover, 



N 2k 



n-A(x)=Z=n'-A(a:') 



Now summing over all / 7^ we get 



(4.2) = E E 

Z^O n-A(x)=n'-A(x') 

Fix e > 0, and choose 5 = e/2/c and i? sufficiently large in terms of 
k and 5, say i2 > 2k + (4k + W0)/6. Also set M = N 1+s . We have 
f(x) <C for large x. In (|4.2j) we break up the sum over n into 

ranges < |n| < M and |n| > M, and likewise for the sum over n' . In 
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the range < \n\ < M we use the bound |/(^)| <C 1, and in the range 
|n| > M we use /(x) <C |x| _i? . This gives 

(4.3) 

$>(/, ao 2 «/ XT^^ < w> M ^ ■ A ( x ) = n ' ■ A (* 



<Xi<N *-^x'.<N 



E- V — V — V V — V I ' ' 
Xi<N £-^x'.<N 'N 
0<\n\<M \n'\>M 



n , D \ , n' 



\n\>M \n'\>M 



-R 



The third term in ( |4.3[ ) is bounded by square of the number of Xj < A" 

— |2L|-H 
|n|>Af I JV I ' 



times the square of the sum Yl\n\>M 1 77 1 H ' gi vm S a total of at most 



N 2k N 2R M -2(R~k) < ^-100 

The second term in ([4.3|) is bounded by 



iV 2fe #{M<M} ^ l-^c^V- 1 -^' 

|n'|>M 

^ pj2k+{l+5)(2k-l)-R8 „ ^y- 100 



The first term of ( |4.3|) is bounded by the number of solutions of the 
equation n ■ A(x) = n' ■ A(x') in variables < \n\, \n'\ < M, Xj < M 
distinct, x'- < M distinct. By Lemma TA , this number is at most 

Thus we find that 

and inserting into (fOI) we get 

var( J R fc (/,iV))«iV- 1+e . 

□ 



5. Small fractional parts 

Our next goal will be almost-everywhere convergence. Preliminary 
to that, we have to investigate the frequency of occurrence of fractional 
parts of aa(x) in short (of size 1/N) intervals. We denote by ||x|| the 
distance to the nearest integer. Our principal result in this section is: 
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Proposition 5.1. Let a(x) be lacunary and let c > 1 be such that 

a(x + 1) > ca(x) 

for all x. Then for almost all a the following holds true: For any e > 
there exists a constant C depending only on c, a and e such that for 
any positive integer N and any real number (5 one has: 

#{x < N : \ \aa(x) - (3\\ < 1/N} < CN £ . 

We first prove the following : 

Lemma 5.2. Let N > 1 and a±, • ■ ■ , d k positive integers such that 
a j+i > Naj for 1 < j ' < k — 1. Then the set 

A(a,N) = {ae[0,l];\\a aj \ \ <^,l<j<k} 



has Lebesgue measure < 



Proof. Let a G A(a,N). For 1 < j < k we write a in the form 

cij 

with bj = bj(a) G {0, 1, • • • , dj} and f3j < From aaj = bj + ajj3j, 
with bj G Z and djbj G [— |, |] it follows that 1 1 cm.,- 1| = and since 

I \ aa j\\ < jj we get I A/ 1 < for 1 < j < k. For any j G {1, • • • , k} let 

Bj = {0 < b < dj : there is a G A(a, iV) with bj(a) = b}. 
Then for any j 

A (s ,iv)cu [^-^.^ + ^]=^. 

In particular one has: 



beBj 1 1 1 r 



2 

meas(A(a, TV)) < me&s(A k ) = #-£>*;. 

a k N 

It remains to bound #B k . In order to do this we produce for any j an 
upper bound for j^Bj in terms of j^Bj_\. Let b G Bj. There is a such 
that bj(a) = b. Write: 

a = — +Pj = ^=^ + /3 j - 1 . 
cij dj-i 

Then one has : 

djbj-i . . „ „ . .1 1 . dj 1 

1 dj-i 1 n 3 n ~ jy Ndj-! Ndj' Ndj-i N 
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For a fixed value of bj_i the integer b may vary in the above interval 
of length ^j- + f , so it takes at most 1 + [^2z_ + |] < 2 + 
values. Hence: 

#B ^ 2 ( 1 + 7v^)# B -" 2£ ^ fc 

Clearly < (1 + ai). By multiplying these inequalities we obtain: 

# Bt <2-(l + ai )(l + ^.-(l + ^) 
and therefore 

meas(A(a, N)) < + 0l )(l + £-) • • • (1 + -^-) 

a^A/ ivai iVafc-i 

2 fe 1 + ai A^ai + a 2 Ndk-i + a k 
N k ai a 2 a fc 
Here we use the assumption that iVa, < a J+ i to conclude that 

measA(a, JV) < 

which completes the proof of the lemma. □ 

We now introduce some notation. Given N > 1 and a, (3 G [0,1] 
denote 

G(N, a, (3) = #{* < N; \\aa(x) - (3\ \ < 1}. 



Then set: 



G(N,a) = max G(N,a,8). 

0e[o,i] 



Given 5 > and N > 1 define the set : 

A(5, N) = {ae [0, 1] : G(N, a) > N s }. 

Note that by the above definitions, if a is not in the exceptional set 
A(5, N) then G(N, a) < N s so uniformly for all (3 one has G(N, a, (3) < 
N 5 , i.e. 

#{ x <N;\\aa(x)-P\\<^}<N s 

for all (3. Set A(8, M) = \Jn>m a ( s , n ) and M 5 ) = flM>i^ M )- 
Now let a A(5). Then there exists M = M(a, 5) such that a is not 
in A(S,M). Thus for any N > M(a,S) we have a A(S,N) and so: 
For any iV > M(a, 5) we have uniformly for all (3 : 

#{x<N;\\aa(x)-(3\\<^}<N s . 
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In other words, if a A(5) then there exists C(5, a, c) such that for all 
N and all ft one has : 

#{s < N; | \aa(x) - ft\ \ < — } < C(S, a, c)N 5 . 



In order to prove Proposition |5.1| we need to show that for any 5 > 
the set A(5) has measure zero. Fix 5 > 0. By the definition of A(S) 
one has me&s A(5) < me&s A(5, M) for any M > 1, so it is enough to 
show that: 

(5.1) meas A(5, M) — > as M oo. 

Now meas A(5, M) < Y1n>m meas A(S, N). Thus in order to prove 
( |5.1| ) it is enough to show that there exists es > such that for any 
N > 1 one has: 

(5.2) meas A(S, N) < c , 5 

We will prove this in the next Lemma, which completes the proof of 



Proposition 5.1 



Lemma 5.3. Given 5 > 0, for any N > 1 one has 

1 



meas A (5, N) < 



c.5 



jyl999 ' 

Proof. Given 5 > we choose a positive integer k, depending on 5 only, 
whose precise value will be given later. Let N > 1 and a G A(S,N). 
There exists ft G [0, 1] such that the set 

M = { x < N; \\aa(x) - ft\\ < -j^} 

has more than [N 5 ] elements. Arrange the elements of M in increasing 
order: {1 < x\ < £2 < • • ■ < x{\ and pick from this set the first 
element xi, then ignore the next r = [(1 + 5) log c N] elements, pick the 
next one, ignore again r elements, and so on. We get a set of "well 
spaced" integers M. = {yi — x\ < y% = x r+ i < y 3 < ■ ■ ■ < y s } with 

S - l+(l+5)log c AT ' SUCn that 

\\ aa (Vj) ~ P\\ < Jj, l<J<s 

and (since yj + i — yj > (1 + S) log c N): 

(5.3) a(y J+1 )>N 1+5 a( yj ), l<j<s-l. 

Now look at the sequence of fractional parts U = ({aa(y :( )})i<j< s . 
They all fall in an interval of length centered in {ft}. We cut this 
interval in m = [£=-] intervals Ji, • • • , J m having the same length: 
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By the box principle, one of these intervals, Jj say, will contain at least 



— = ttItt > 7§=T7 > k elements of U, that is, J; n will contain at least 

fc + 1 elements of W. So let z < z i < ■ ' • < z k be + 1 elements of 
M. for which the fractional parts {aa(z )}, ■ ■ ■ , {aa(z k )} belong to J io . 
Then clearly one has: 

(5.4) \\a{a{z x ) - a(^ ))H, • • • , \\a(a(z k ) - a(z ))\\ < length] J io \ 

2 2 < 4A; < 4fc(l + (1 + 5)log c iV) 1 



~~ Nm N[*=±] ~ Ns ~ N 1+s " 

for iV sufficiently large in terms of c, and 5. Note also that since the 
Zi are still well-spaced, by ( |5.3| ) one has: 

(5.5) ofa) > iV 1+5 a(z ), • • • , a(z k ) > N 1+S a(z k ^) . 
Let a = (ai, . . . , afc) be given by: 

a x = a(zi) - a(z ), • • • , a k = a{z k ) - a(z ) . 
By ( |5.5|) we see that for i = 1, • • • , k — 1 one has: 

(5.6) a i+ i = a(z i+1 ) - a(z ) > A^ 1+l5 a(z i ) - a(z ) 

> N 1+5 (a( Zi ) - a(z )) = N 1+S ai 

while (|5.4|) says that 



(5.7) ||«cJi|| < — , 1 < i < k. 

N 1+ 2 

From (|5.6|) and ( |5.7|) we see that one may apply Lemma 5^2 to the 
vector a, with N replaced by N 1+ ? . In the terminology of that Lemma, 
a belongs to A (a, N 1+ z). Since for each a G A(5,N) there is such a 
vector a it follows that 



A(5, N) C IjA^AT 1 - 



By Lemma |5.2| we derive: 

measA(5,iV) < ^ meas A(a, iV 1+ i) < 4 . 

a 

Now each vector a as above is uniquely determined by a (k + l)-tuple 
(zo, Zi, . . . , Zk) of positive integers < N. The number of such (k + 1)- 
tuples is < N k+1 . It follows that 

N k+1 N 
me&s A(8,N) < C)M = — 

We now let k = and the lemma is proved. □ 



SPACINGS OF FRACTIONAL PARTS OF LACUNARY SEQUENCES 17 



6. Almost everywhere convergence 

We now show that there is a set of a of full measure so that for 
all k > 2 and all test functions / £ C^°(R fc_1 ), the fc-level correlation 
functions R k (f, N)(a) converge to J f(x)dx. The main ingredient here 
is: 

Proposition 6.1. Fix f £ (7~(R fc_1 ). If < 5 < 1 and 1 < K < 

N 1 ^ 6 then for almost every a 

R k (f,N + K)(a)-R k (f,N)(a)^0 
6.1. Proof of Theorem |1.2| . We first show how Proposition |6J] im- 



plies Theorem |1.2| : By Proposition for fixed / we have 
1 \R k (f,N)(a) - E(R k (f,N))\ 2 da « e N'"/ 100 
and so if we take N m ~ 772 101 / 99 then 

/ J2\Mf,N m )(a)-E(R k (f,N)\ 2 da 

= Y, f \Rk(f,N m )(a)-E(R k (f,N)\ 2 da 

< m ioi/ioo < 00 

m 

Thus the sum \R k (f, N m )(a) — E(R k (f, N)\ 2 is finite almost every- 
where, and hence the individual summands converge to zero as m — » 00 
for almost all a. 

For each iV we can find m such that N m < N < iV m+1 . Then since 
R k (f, N m )(a) — E(R k (f, N) — > for almost all a, and by Lemma |Q| , 



E(R k (f,N) -> /(0), Proposition |J will show that R k (f,N)(a) -> 
/(0) for a set of full measure of a which depend on the test function 
/. By a standard diagonalization argument one can pass to a subset 
of full measure of a's which work for all /'s (see [§]). □ 

6.2. An upper bound for R k (f,N). As a consequence of Propo- 
sition lTT we have the following a-priori estimate on the correlation 
functions: 

Lemma 6.2. For almost all a we have 



R k (f,N)(a)« eJ N' 
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Proof. We use the representation of R k (f,N) as in ( |1.1| ): 

iv z — <Xi<N 

where A(x) := (a(xi) — 0(^2), . . . , a(x k ~i) — a(x k )). Note that 

\R k (f,N)\ <R k (\f\,N) 

so we may assume / > 0. Now fix an, and set /3 = aa(xi); then for 
aA(i) to lie in the support of F N , we need ||aa(x 2 ) — /3|| <C/ 1/iV. By 
Proposition |5.1| , for almost all a there are at most Of }£ (N £ ) integers 
X2 < N satisfying this. Similarly, we need ||aa(xj) — (3\\ <C/ 1/N for 
all 2 < i < k which forces the number of possible x = (xi,...x k ) 
contributing to the sum to be at most 0(N e ). Now summing over the 
N possible an's gives R k (f, N)(a) < e ,/ N e . □ 

6.3. Proof of Proposition |6.1| . Now fix < 5 < 1 and assume that 
K < N 1 ^ 6 . We will show that for almost all a, 

\R k (f, N + K)(a) - R k (f, N)(a)\ < KN~ 1 ^ 

Step 1: In the expression 

we can replace 1/(N + fC) by 1/N with error O e j(KN~ 1+e ) . 
Indeed,by Lemma |T2|, -Rfc(/, AT + K) < iV e and so 

i El,< K+K f »«(» A w) = & + £ >*</. " + A ") 

= B fc (/,iV + A')+0(^]V') 

as claimed. 

Step 2: VKe may replace the sum over (distinct) Xj < N + K by the 
sum over (distinct) Xi < N: 

J21^ n+k Fn + k(^(x)) = J21^ n Fn + k(^)) + 0(KN £ ) . 

Indeed, the difference between the two sums is a sum over a union of 
subsets 

S(I) = {(an, ...,x k ) distinct : N < Xi < N + K,i e I,Xj < N,j 1} 

where the index set I runs over all the 2 k — 1 nonempty subsets of 
{1,2,..., A;}. 

To estimate the contribution of S(J) := Yj X eS(i) ^JV+ic(o;A(ar)), we 
use the consequence of Proposition |5.1| , which says that if we fix one 
of the coordinate axes io, then the number of vectors x with x, = y 
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fixed which contribute to the sum is 0(N e ), uniformly in y. Thus the 
number of vectors in S(I) which contribute to the sum £(/) is at most 
0(KN e ), because if we look at i G / we have N < x io < N + K for 
x G S(I), and so 

£(/) < />6 KN e max |/| < KN e . 

Thus we find 

R k (f, N + K)- R k (f, N) = ±-Y*^ F N+K (aA(x)) - F N (aA(x)) 

iV * — 'xi<N 

+ O u (KN- 1+ *). 
Step 3: We s/iow £/ia£ /or almost all a, 

jf El i<iV *W« A (*)) - i^(«A(x)) « /TAT 1 "* . 

Remark: This is the statement that the correlation functions are inde- 
pendent of the exact unfolding procedure! 

First, a digression: Given a vector y G R fc_1 , there is a unique 
integer vector m y G Z fc_1 so that y + m y lies in the cube (— 1/2, l/2] fc_1 . 
Moreover, for any other integer vector m 7^ m s , | \m + y\ \ > 1/2 and so 
||iV(m + y)|| > N/2. Thus if A" is sufficiently large so that supp(/) lies 
in a ball of radius p(f) < N/2 around the origin, then 

F N (y) = f(N(m y + y)) 

and 

(6.1) \\N(m y + y)\\<p(f). 

Furthermore, if m ^ m y then \\(N+K)(m y +y)\\ > \\N(m+y)\\ > N/2 
and therefore 

F N+K (y) = f((N + K)(m y + y)). 
Apply these considerations to y — aA(x) and abbreviate 

v x ■■= m aA(x) + aA(x) 
to get that if JV > JV (/) then 
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By the mean value theorem, 

f((N + K)v x ) - f(Nv x ) = Kv x ■ Vf(Nv x + 6Kv x ) 

(6.2) R K 

= -Nv x -Vf(Nv x (l + 9-)) 

for some < 9 = 9 X < 1 depending on x. If this is nonzero, then 
certainly Nv x is contained in a ball of radius 2p(f) around the origin. 
Now ll-ZVupll < p(f) by ( |6.1| ), so the sum of the terms ( |6.2j ) is bounded 
by p(f) max ||V/|| times the number of x for which Nv x lies in a ball 
of radius 2p(f) around the origin. 

We can now bound the sum of ( |6.2|) by relating it to a smoothed k- 
level correlation function as follows: Choose a positive, smooth function 
g G C^°(R ) which is constant on the ball of radius 2p(/) around the 
origin, and satisfies g > max||V/||. Write G^{y) := J2 m g(N(m + y))- 
Then 

Vf(Nv x (l + 9^)) < g(Nv x ) = G N (aA(x)) . 
Thus we find that 



1 X ^ 

TV 2-*>xi 



<N 



f((N + K)v x ) - f(Nv x ) « §p(f)^ J2l i<N GN(aA(x)) 



N 
K 



N 



p(f)R k (g,N) 



By Lemma |6.2| , Rk{g } N) -C 9 , e N e for a.e. a, which gives the result of 



step 3. This concludes the Proof of Proposition 5.1 



□ 
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